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Abstract — A new family of codes based on polar codes, soft 
concatenation and list+CRC decoding is proposed. Numerical 
experiments show the performance competitive with industry 
standards and Tal, Vardy approach. 

I. Introduction 

Polar codes were invented by E. Arikan in 2008. They are 
the first coding system possessing, on the theorem level, the 
convergence to Shannon limit for code length oo, as well 
as fast encoding/decoding algorithms with complexity bound 
0{N \og2 N). Thus polar codes are a significant theoretical 
result. 

On the other hand, the performance of polar codes in their 
initial form presented by Arikan, is considerably inferior, for 
a fixed code length, to other coding systems used in various 
communication system standards f5\. In order to improve the 
polar code performance for moderate block lengths, there 
were made some proposals based on the idea of combining 
polar and various other codes 1*21, (3], f4\. However only the 
system Polar + List + CRC introduced in Tal, Vardy 141 seems 
to have the potential of outperforming the coding systems 
currently used in industry. This system in our opinion has 
some drawbacks, an important one being the significant growth 
of decoder complexity for large list lengths, e.g. L = 32. 
Important theoretical complexity bound 0{LN log N)f^ does 
not adequately show large overhead of this method which can 
be seen in practice. Thereby construction of coding schemes 
of significantly lower complexity with comparable or better 
performance than those of [4] is an important problem. In 
this paper we propose a method of construction of high- 
performance polar coding based schemes which possesses 
decoding complexity lower than that of |4|. Our approach is 
a combination of soft concatenation |6| and list decoding idea 
supplied with some fast (CRC) check of candidates. The paper 
contains only the schemes themselves and some numerical 
experiments. Justification, optimal choice of parameters and 
thorough comparison with other methods are left for future 
work. 

The paper is organized as follows. Section 2 contains 
description of soft concatenation schemes |6|. Combination of 
these schemes with idea pursued in Tal, Vardy |4| is presented 
in section 3. Section 4 contains the results of numerical 
experiments obtained for the new coding scheme. 



II. Soft concatenation scheme 

In this section we consider a method of performance im- 
provement for polar codes |6| in which short classic error 
correcting codes are used together with polar codes. 

Let Ci , C2 , . . . , Cq be a set of linear codes of equal length 
M. Let Ki be the number of information bits in the code Ci. 
Let V be some M x N matrix each of whose elements is 
or 1. Denote by Vji with < i < and < j < M the 
elements of V, by its row j and by Vi its column i. For 
all i — 0,N — 1 choose some integer in the range 1 to 
q. We consider only such matrices V whose columns Vi are 
codewords of Ca,, i.e. 



O.A^- 1. 



(1) 



Consider an arbitrary polar code of length A^ and rate 1, i.e. 
without redundancy, with matrix generator G G GF(2)^^^. 
Encode each row of V with this polar code obtaining a new 
matrix X e GF(2)*^'<^: 



X = VG. 



(2) 



If the matrix X is "reshaped" into a row, one can consider 
the set of all such possible rows subject to restriction ([T]) as a 
linear code of length M ■ N and rate 
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Thus obtained linear code we will call the soft concatenated 
polar code. Let Y be the matrix received after the transmission 
of X through the channel and let be its row j. The 
decoder works by applying alternatively the steps of successive 
cancellation method for rows of Y and maximum likelihood 
decoder for its columns. 

In order to decode the column wq, compute for each row of 
Y independently the logarithmic likelihood ratios 

Pr{yJ"|i>,.o = 0} 



In- 



Vv{yi\vjs) = 1}' 

just like in the usual successive cancellation method. Then 
the values L{vj^o), j = 0, A/ — 1 gathered in a vector y are 
given as input to ML-decoder for the code Cqo. The most 
likely codeword w e Ca^ produced on output is taken as an 
estimate of vq. 



Next we compute the estimate of Vi. Assuming vq already 
known, again compute for each row independently the LLRs 

. _ Pr{y^>jM =0; vj.o} 

concatenate the values L{vj^i) into a vector y, which will 
be the input of ML-decoder for the code Ca^. The obtained 
codeword is taken as an estimate of vi. Next, assuming vq 
and vi exactly known, compute the estimate of V2 etc. 

Note that the polar codes are a special case of concatenated 
polar codes for M = 1, g = 2 and Ci = {0}, Ca = {0, 1}. In 
this case, bit i is frozen if = 1, and it is information bit, if 
Oi = 2. 

Let Ei be the error probability for estimation of column i 
under the constraint that all previous columns were estimated 
error-free. Write the upper bound for block error probability: 

AT-l 
i=0 

Fix some symmetric channel W, set of codes Ci , . . . , 
of length M, polar code of length N and rate 1. We require 
to construct a concatenated polar code of given rate k/N, i.e. 
choose numbers ao, ai, . . . , Um-i such that 

2=0 

We will choose these numbers so as to minimize the upper 
bound ([3]). Denote by the error probability for estimation 
of the column Vi under the constraint that all previous columns 
were estimated error-free and at — k. Note that E^ does 
not depend on aj for all j ^ i. For a concrete choice of 
ao, ai, . . . , oat-i we can write the following upper bound for 
Pe, 

N-l 

Pe<Y.ET- (5) 

Assume for now that for all i = 0,iV — 1 and k = l,q 
we can compute E'^. In this case, the optimal values of 
ao,ai, . . . ,aN-i, can be determined using the dynamic pro- 
gramming method 161. 

Since the channel is symmetric and the code is linear, 
we assume the all-zero codeword is sent. Suppose that the 
columns Wq, "fi, • . • , fi-i have been estimated correctly and 
the decoder is to estimate Vi. Next the ML-decoder for the 
code Cfe takes on input the vector 

A = [L{vo,i), L{vi^i), L{v24), ■ ■ .,L{vm-ia)]- 

For convenience, introduce the notation Xj = L{vj^i). The 
components of A are i.i.d. random variables. Their proba- 
biUty function (or pdf) fi can be computed approximately 
||6l. We can assume that the column vi is transmitted via 
some symmetric channel with LLR distribution fi. Thus the 
problem of computing E^ is reduced to the estimation of 
error probability for the ML-decoder on a channel with given 



probability function fi. It is well-known that the ML-decoder 
minimizes the linear functional 

3=0 

where c = [cq, ci, . . . , cm-i] runs over all codewords of the 
code Cfe. For the all-zero codeword the functional 4> is zero. 
Hence if the decoding error occurs, there necessarily exists 
some codeword c' such that 0(c') < 0. The last inequality can 
be rewritten as the sum of wh{c') terms, 

j tz SUpp C' 

Some nonzero codeword c' will be strictly more preferable 
than if 4>{c') < and in this case the decoder error will 
surely occur If ^f)(c') = 0, the decoder may choose the 
correct codeword among those which zero the functional cj). 
For simplicity assume that (f>{c') — also implies the decoder 
error. Write the probability of the event that for a fixed c' the 
inequality 0(c') < holds as 

Pr| E A,<oi. 
l_ J e supp c' J 

The sum consists of wh{c') i.i.d. random variables with the 
probability function fi, therefore the probability function of 
the sum is 

— f — ' 

wh{c') times 

It follows that the probability of the event </>(c') < depends 
only on the weight w of the codeword c' and it can be written 

as 

P{f,w)^ E Z^'^^^)- 

a:(isupp f*^' : a:<0 

The main contribution in the error probability is made by 
codewords of minimal weight. Let dk be the code distance of 
the code Ck, and let be the number of different codewords 
of weight dk in the code Ck. Then the probability Ei may be 
estimated as 

E'y^mk-P{f,,dk). (6) 

III. New scheme description 

Our proposal is the combination of soft concatenation 
schemes and Tal-Vardy method. We start from a special case 
when the dimensions of the matrix X Q are powers of two, 
iV = 2", M = 2". 

Each column of the matrix X will be connected with some 
polar code chosen in a way which will minimize the block 
error probability. Besides, each column of the matrix X can 
contain certain number of bits (e.g. 4, 8 or 16) reserved for 
CRC. The choice of exact number of those bits we leave as an 
open question. In our experiments, we use CRC-4 check for 
every column. The decoding procedure for such code consists 
in alternatively applying steps of successive cancellation for 



rows of the matrix Y and steps of List+SC+CRC for its 
columns. Therefore each column has also a corresponding list 
size as a parameter which is used in decoding. It is quite 
obvious that channels corresponding to columns of X with 
"better" statistical properties should correspond to smaller list 
sizes. It is not difficult to show that the decoding complexity 
will be 

0(7VM(Lav log M + log iV)) (*) 

where Lav is the average list size. 

One may think that the estimate (*) is quite similar to 
0{LNM \og{NM)^, where L is the maximum Ust size. How- 
ever due to channel polarization we have in general Lav < L. 
Besides, the new scheme possesses some natural parallelism. 
Indeed, column bits for each step of row-wise successive 
cancellation may be processed simultaneously and the decoder 
required for this operation works with codewords of signifi- 
cantly smaller length. Using the parallel construction one can 
reduce the decoder complexity to 0(A^(log N+Li^^M log M). 

In a more general scheme, the columns of the matrix X 
represent the codewords of a different code family. In ||6| 
(see also section 2) we have used the codes of length 32. For 
each of those codes, the maximum likelihood (Viterbi) decoder 
was used. Alternatively, one can use list Viterbi decoding, see 
e.g. Q. Using additionally CRC bits we obtain a construction 
where a variety of codes is used as internal codes achieving 
thus better performance. Theoretical study of these issues goes 
beyond the scope of this article. 

We note in addition that a scheme containing list decoding 
at the first step is quite possible. However the complexity of 
the resulting decoder generally has a factor L^. Nevertheless 
the idea of LDPC codes as internal ones seems to us very 
attractive. 

IV. Numerical experiments 



III we have 



Using the approach described in section 
constructed two codes with = 16, M = 32 and rate 
values 13/16, 3/4. The average list size used was Lav — 8. 
The experiments show that the new code performance is 
comparable to existing industry standards as well as to the 
Tal-Vardy scheme. This suggests that the improvement of the 
proposed technology will result in quite competitive codes. 

One can notice that for large SNRs the error rate of the 
proposed scheme becomes worse than that of other tested 
codes. At present, we cannot say whether this effect is intrinsic 
to the proposed scheme or it can be removed with increase of 
outer codes length or some other optimization. 
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Fig. 1. Comparison of performance of original Ankan code, Tal-Vardy 
scheme {L = 8) and proposed scheme for code length 512, rate 13/16 on 
an AWGN channel 
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Fig. 2. Comparison of performance of an LDPC code, 802. Had standard 
and proposed scheme on an AWGN channel. First two codes are (672, 546), 
while the third one is (512, 416). 
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Fig. 3. Comparison of performance of original Arikan code, Tal-Vardy 
scheme (L = 8) and proposed sciieme for code length 512, rate 3/4 on an 
AWGN channel. 
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Fig. 4. Comparison of performance of an LDPC code, 802. 11 ad standard 
and proposed scheme on an AWGN channel. First two codes are (672, 504) 
while the third one is (512, 384). 



